Robust Tests in Online Decision-Making

نویسندگان

چکیده

Bandit algorithms are widely used in sequential decision problems to maximize the cumulative reward. One potential application is mobile health, where goal promote user's health through personalized interventions based on user specific information acquired wearable devices. Important considerations include type of, and frequency with which data collected (e.g. GPS, or continuous monitoring), as such factors can severely impact app performance users’ adherence. In order balance need collect that useful constraint of impacting performance, one needs be able assess usefulness variables. feedback sequentially correlated, so traditional testing procedures developed for independent cannot apply. Recently, a statistical procedure was actor-critic bandit algorithm. An algorithm maintains two separate models, actor, action selection policy, other critic, reward model. The well validity test guaranteed only when critic model correctly specified. However, misspecification frequent practice due incorrect functional form missing covariates. this work, we propose modified robust derive novel actor parameters case.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Provenance for Online Decision Making

It is commonly believed that provenance can be utilised to form assessments about the quality, reliability or trustworthiness of data. Once presented with contradictory or questionable information, users can seek further validation by referring to its provenance. While there has been some effort to design principled methods to analyse provenance, the focus has mostly been on offline use of prov...

متن کامل

Robust Multi-Stage Decision Making

Testifying to more than ten years of academic and practical developments, this tutorial attempts to provide a succinct yet unified view of the robust multi-stage decision making framework. In particular, the reader should better understand: (1) the distinction between static versus fully or partially adjustable decisions; (2) the root of tractability issues; (3) the connection to robust dynamic...

متن کامل

Robust Decision-making Under Ambiguity

Most of management research, following on the paradigm of expected utility theory, has developed complex models of optimal managerial action in the presence of uncertainty. Still, the assumption that managers assign probabilities to outcomes, and consequently optimize their actions has come under criticism from a number of empirical studies. Researchers, working on probability assessments, have...

متن کامل

Online Decision Making in VR Application Environments

The aim of this paper is to understand the process by which consumers’ perception of online VR environments impact their purchase decision. Combining factor and process models, we propose a transaction framework suggestive of consumer decision-making in VR e-commerce environments. The framework is informed by theory to be validated by an experimental design to understand the antecedents and con...

متن کامل

Online Decision-Making in General Combinatorial Spaces

We study online combinatorial decision problems, where one must make sequential decisions in some combinatorial space without knowing in advance the cost of decisions on each trial; the goal is to minimize the total regret over some sequence of trials relative to the best fixed decision in hindsight. Such problems have been studied mostly in settings where decisions are represented by Boolean v...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2022

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v36i9.21240